Skip to content

fix(ci): migrate benchmarks to benchmarking-platform trigger#607

Draft
jbachorik wants to merge 26 commits into
mainfrom
jb/bench-bp-trigger
Draft

fix(ci): migrate benchmarks to benchmarking-platform trigger#607
jbachorik wants to merge 26 commits into
mainfrom
jb/bench-bp-trigger

Conversation

@jbachorik

Copy link
Copy Markdown
Collaborator

What does this PR do?:
Replaces the broken inline benchmark jobs with a benchmarking-platform bridge trigger. The old jobs ran the profiler inline via shell scripts and uploaded results to S3 themselves; the new approach delegates entirely to the BP pipeline at DataDog/apm-reliability/benchmarking-platform@java-profiler.

Motivation:
The existing benchmark CI was broken and unmaintained. The benchmarking-platform provides a reliable, standardised infra for running and tracking reliability benchmarks.

Additional Notes:

  • benchmarks-trigger is a GitLab bridge job — it cannot appear in needs: of downstream jobs, so post-benchmarks-pr-comment and publish-benchmark-gh-pages now run in a post-benchmarks stage ordered after benchmarks.
  • download-s3-reports.sh fetches reports uploaded by BP under s3://relenv-benchmarking-data/java-profiler/${CI_PIPELINE_ID}/.
  • The images.yml include is removed — the BP side manages its own Docker image build.
  • jb/bench-memory-limit-fix is superseded by this PR (the memory-heavy aarch64 jobs are gone).
  • The BP side implementation is in DataDog/benchmarking-platform PR Run tests on musl aarch64 #190.

How to test the change?:
Push to a branch and verify the GitLab pipeline shows a benchmarks-trigger bridge job that fires the BP downstream pipeline. The post-benchmarks-pr-comment and publish-benchmark-gh-pages jobs should run in the post-benchmarks stage after the trigger completes.

For Datadog employees:

  • This PR doesn't touch any of that.
  • JIRA: [JIRA-XXXX]

@jbachorik jbachorik added the AI label Jun 18, 2026
@datadog-datadog-prod-us1

This comment has been minimized.

@dd-octo-sts

dd-octo-sts Bot commented Jun 18, 2026

Copy link
Copy Markdown
Contributor

CI Test Results

Run: #27864315383 | Commit: db6613d | Duration: 14m 26s (longest job)

All 32 test jobs passed

Status Overview

JDK glibc-aarch64/debug glibc-amd64/debug musl-aarch64/debug musl-amd64/debug
8 - - -
8-ibm - - -
8-j9 - -
8-librca - -
8-orcl - - -
11 - - -
11-j9 - -
11-librca - -
17 - -
17-graal - -
17-j9 - -
17-librca - -
21 - -
21-graal - -
21-librca - -
25 - -
25-graal - -
25-librca - -

Legend: ✅ passed | ❌ failed | ⚪ skipped | 🚫 cancelled

Summary: Total: 32 | Passed: 32 | Failed: 0


Updated: 2026-06-20 07:49:13 UTC

@dd-octo-sts

dd-octo-sts Bot commented Jun 19, 2026

Copy link
Copy Markdown
Contributor

Benchmark Results

Pipeline: https://gitlab.ddbuild.io/DataDog/apm-reliability/benchmarking-platform/-/pipelines/120030223 Commit: 287647565feb9fc20032b357721816fbbb640962

Reliability Benchmarks

Benchmark JDK Latest Dev Δ% (dev vs latest) Issues L/D
akka-uct 21 ✅ 10125 ms (7 iters) ✅ 10340 ms (7 iters) 🔴 +2.1% — / —
akka-uct 25 ✅ 8997 ms (8 iters) ✅ 8955 ms (8 iters) -0.5% — / —
finagle-chirper 21 ✅ 5949 ms (11 iters) ✅ 6012 ms (11 iters) 1.1% ⚠️ W:1 / ⚠️ W:1
finagle-chirper 25 ✅ 5523 ms (12 iters) ✅ 5462 ms (12 iters) -1.1% ⚠️ W:1 / ⚠️ W:1
fj-kmeans 21 ✅ 2851 ms (22 iters) ✅ 2828 ms (22 iters) -0.8% — / —
fj-kmeans 25 ✅ 2865 ms (22 iters) ✅ 2872 ms (22 iters) 0.2% — / —
future-genetic 21 ✅ 2028 ms (31 iters) ✅ 2078 ms (30 iters) 🔴 +2.5% — / —
future-genetic 25 ✅ 1995 ms (31 iters) ✅ 1962 ms (32 iters) -1.7% — / —
naive-bayes 21 ✅ 1230 ms (46 iters) ✅ 1262 ms (45 iters) 🔴 +2.6% — / —
naive-bayes 25 ✅ 1035 ms (55 iters) ✅ 907 ms (62 iters) 🟢 -12.4% — / —
reactors 21 ✅ 15566 ms (5 iters) ✅ 15368 ms (6 iters) -1.3% — / —
reactors 25 ✅ 17898 ms (5 iters) ✅ 18181 ms (5 iters) 1.6% — / —

Internal Counters (latest / dev)

Final ddprof internal counters from the retained JFR (· = unavailable). All zeros = clean.

Benchmark JDK Dropped rec Dropped jvmti Dropped trace Skipped WC AGCT fail Unwind fail
akka-uct 21 · / · · / · · / · · / · · / · · / 0
akka-uct 25 0 / 0 0 / 0 2 / 3 2330 / 2206 0 / 0 0 / 0
finagle-chirper 21 · / · · / · · / · · / · · / · 0 / ·
finagle-chirper 25 0 / 0 0 / 0 5 / 1 8748 / 8304 0 / 0 0 / 0
fj-kmeans 21 · / · · / · · / · · / · · / · 0 / ·
fj-kmeans 25 0 / 0 0 / 0 1 / 6 1300 / 1296 0 / 0 0 / 0
future-genetic 21 · / · · / · · / · · / · · / · 0 / ·
future-genetic 25 0 / 0 0 / 0 2 / 1 2882 / 2914 0 / 0 0 / 0
naive-bayes 21 · / · · / · · / · · / · · / · · / 0
naive-bayes 25 0 / 0 0 / 0 4 / 4 3476 / 3456 0 / 0 0 / 0
reactors 21 0 / 0 0 / 0 0 / 0 1568 / 1798 0 / 0 0 / 0
reactors 25 0 / 0 0 / 0 0 / 0 1794 / 1820 0 / 0 0 / 0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant